Regular Functions, Cost Register Automata, and Generalized 

Min-Cost Problems 



Rajeev Alur Loris D'Antoni Jyotirmoy V. Deshmukh Mukund Ragothaman 

Yifei Yuan 

February 23, 2012 



Abstract 

Motivated by the successful application of the theory of regular languages to formal verifi- 
cation of finite-state systems, there is a renewed interest in developing a theory of analyzable 
functions from strings to numerical values that can provide a foundation for analyzing quan- 
titative properties of finite-state systems. In this paper, we propose a deterministic model for 
associating costs with strings that is parameterized by operations of interest (such as addition, 
scaling, and min), a notion of regularity that provides a yardstick to measure expressiveness, 
and study decision problems and theoretical properties of resulting classes of cost functions. Our 
definition of regularity relies on the theory of string-to-tree transducers, and allows associating 
costs with events that are conditional upon regular properties of future events. Our model of 
cost register automata allows computation of regular functions using multiple "write-only" reg- 
isters whose values can be combined using the allowed set of operations. We show that classical 
shortest-path algorithms as well as algorithms designed for computing discounted costs, can be 
adopted for solving the min-cost problems for the more general classes of functions specified 
in our model. Cost register automata with min and increment give a deterministic model that 
is equivalent to weighted automata, an extensively studied nondeterministic model, and this 
connection results in new insights and new open problems. 



1 Introduction 



1.1 Motivation 

The classical shortest path problem is to determine the minimum-cost path in a finite graph whose 
edges are labeled with costs from a numerical domain. In this formulation, the cost at a given 
step is determined locally, and this does not permit associating alternative costs in a speculative 
manner. For example, one cannot specify that "the cost of an event e is 5, but it can be reduced 
to 4 provided an event e' occurs sometime later." Such a constraint can be captured by the well- 
studied framework of weighted automata |32] 114] . A weighted automaton is a nondeterministic 
finite-state automaton whose edges are labeled with symbols in a finite alphabet S and costs in 
a numerical domain. Such an automaton maps a string w over £ to the minimum over costs of 
all accepting paths of the automaton over w. There is extensive literature on weighted automata 
with applications to speech and image processing [27]. Motivated by the successful application of 
the theory of regular languages to formal verification of finite-state systems, there is a renewed 
interest in weighted automata as a plausible foundation for analyzing quantitative properties (such 
as power consumption) of finite-state systems [HI El [1] . Weighted automata, however, are inherently 
nondeterministic, and are restricted to cost domains that support two operations with the algebraic 
structure of a semiring, one operation for summing up costs along a path (such as +), and one 
for aggregating costs of alternative paths (such as min). Thus, weighted automata, and other 
existing frameworks (see |11] 130]). do not provide guidance on how to combine and define costs in 
presence of multiple operations such as paying incremental costs, scaling by discounting factors, 
and choosing minimum. In particular, one cannot specify that "the cost of an event e is 10, but 
for every future occurrence of the event e', we offer a refund of 5% to the entire cost accumulated 
until e." The existing work on "generalized shortest paths" considers extensions that allow costs 
with future discounting, and while it presents interesting polynomial-time algorithms |19} 131]. does 
not attempt to identify the class of models for which these algorithmic ideas are applicable. This 
motivates the problem we address: what is a plausible definition of regular functions from strings 
to cost domains, and how can such functions be specified and effectively analyzed? 

1.2 Proposed Definition of Regularity 

When should be a function from strings to a cost domain, say the set N of natural numbers, be 
considered regular? Ideally, we wish for an abstract machine-independent definition with appealing 
closure properties and decidable analysis questions. We argue that the desired class of functions is 
parameterized by the operations supported on the cost domain. Our notion of regularity is defined 
with respect to a regular set T of terms specified using a grammar. For example, the grammar 
t := +(t,t) | c specifies terms that can be built from constants using a binary operator +, and the 
grammar t := +(t,c) | * (t,d)\c specifies terms that can be built from constants using two binary 
operators + and * in a left-linear manner. Given a function g that maps strings to terms in T, 
and an interpretation [.] for the function symbols over a domain D, we can define a cost function / 
that maps a string w to the value [<7(iu)l- The theory of tree transducers, developed in the context 
of syntax-directed program transformations and processing of XML documents, suggests that the 
class of regular string-to-term transformations has the desired trade-off between expressiveness and 
analyzability as it has appealing closure properties and multiple characterizations using transducer 
models as well as Monadic-Second-Order logic [U4 [T2l HJ [20]. As a result, we call a cost function 



1 



/ from strings to a cost domain D regular with respect to a set T of terms and an interpretation 
[.] for the function symbols, exactly when f can be expressed as a composition of a regular function 
from strings to T and evaluation according to [.]. 

1.3 Machine Model: Cost Register Automata 

Having chosen a notion of regularity as a yardstick for expressiveness, we now need a corresponding 
machine model that associates costs with strings in a natural way. Guided by our recent work 
on streaming transducers [3j H], we propose the model of cost register automata: a CRA is a 
deterministic machine that maps strings over an input alphabet to cost values using a finite-state 
control and a finite set of cost registers. At each step, the machine reads an input symbol, updates 
its control state, and updates its registers using a parallel assignment, where the definition is 
parameterized by the set of expressions that can be used in the assignments. For example, a CRA 
with increments can use multiple registers to compute alternative costs, perform updates of the 
form x := y + c at each step, and commit to the cost computed in one of the registers at the end. 
Besides studying CRAs with operations such as increment, addition, min, and scaling by a factor, 
we explore the following two variants. First, we consider models in which registers hold not only 
cost values, but (unary) cost functions: we allow registers to hold pairs of values, where a pair 
(c,d) can represent the linear function f(n) = c + n* d. Operations on such pairs can simulate 
"substitution" in trees, and allow computing with contexts, where parameters can be instantiated 
at later steps. Second, we consider "copyless" models where each register can be used at most once 
in the right-hand-sides of expressions updating registers at any step. This "single-use-restriction", 
known to be critical in theory of regular tree transducers [17) . ensures that costs (or the sizes of 
terms that capture costs) grow only linearly with the length of the input. 

1.4 Contributions 

In Section 4, we study the class of cost functions over a domain D with a commutative associative 
function ®; in Section 5, we study the class of cost functions over a semiring structure with domain 
D and binary operations © (such as min) and ® (such as addition); and in Section 6, we consider 
different forms of discounted cost functions with scaling and addition. In each case, we identify 
the operations a CRA must use for expressiveness equivalent to the corresponding class of regular 
functions, and present algorithms for computing the min-cost value and checking equivalence of 
CRAs. We summarize some interesting insights that emerge from our results about specific cost 
models. First, our notion of regularity implies that regular cost functions are closed under oper- 
ations such as string reversal and regular look-ahead, leading to an appealing symmetry between 
past and future. Second, the use of multiple registers and explicit combinators allows CRAs to 
compute all regular functions in a deterministic manner. Third, despite this added expressiveness, 
decision problems for CRAs are typically analyzable. In particular, we get algorithms for solving 
min-cost problems for more general ways of specifying discounting than known before. Fourth, 
since CRAs "construct" costs over an infinite domain, it suffices to use registers in a "write-only" 
mode without any tests. This critically distinguishes our model from the well-studied models of 
register machines, and more recently, data automata \21\ \30[ [7] : a data automaton accepts strings 
over an infinite alphabet, and the model allows at least testing equality of data values leading to 
mostly negative results regarding decidability. Fifth, it is known that weighted automata are not 
determinizable, which has sparked extensive research [26) 124) . Our results show that in presence of 
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multiple registers that can be updated by explicitly applying both the operations of the semiring, 
classical subset construction can be modified to get a deterministic machine. Finally, the class 
of regular functions over the semiring turns out to be a strict subset of functions definable by 
weighted automata due to the copyless (or linear) restriction. It is known that checking equiva- 
lence of weighted automata over the tropical semiring (natural numbers with min and addition) is 
undecidable |25} [T]. Existing proofs critically rely on the "copyful" nature raising the intriguing 
prospect that equivalence is decidable for regular functions over the tropical semiring. 

2 Cost Register Automata 

2.1 Cost Grammars 

A ranked alphabet F is a set of function symbols, each of which has a fixed arity. The arity-0 
symbols, also called constants, are mapped to domain elements. To allow infinite domains such as 
the set N of natural numbers, we need a way of encoding constants as strings over a finite set of 
symbols either in unary or binary, but we suppress this detail, and assume that there are infinitely 
many constant symbols. The set Tp of terms over a ranked alphabet is defined in the standard 
fashion: if c is a constant symbol in F, then c S Tp, and if t%, . . . € Tp and / is an arity-A; 
symbol in F, for k > 0, then f(t±, ... ,tk) G Tp. A cost grammar G is defined as a tuple (F,T) 
where F is a ranked alphabet and T is a regular subset of Tp. In this paper, we define this regular 
subset using a grammar containing a single nonterminal. In particular, we focus on the following 
grammars: For a binary function symbol +, the terms of the additive- grammar G(+) are specified 
by t := +(t,t)\c, where c is a constant, and the terms of the increment- grammar G(+c) are 
specified by t := +(t, c) | c. Given binary functions min and +, the terms of the min-inc- grammar 
G(min, +c) are given by t := mm(t,t) | + (t, c) | c, which restricts the use of addition operation. 
Given binary functions + and *, the terms of the inc-scale Grammar G(+c,*d) are generated by 
the leftdinear grammar t := +(i,c) | * (t,d) | c, that uses both operations in a restricted manner, 
and c and d denote constants, ranging over possibly different subsets of domain elements. 

2.2 Cost Models 

Given a cost grammar G = [F, T), a cost model C is defined as the tuple (G, D, [.]), where the 
cost domain D is a finite or infinite set. For each constant c in F, [c] is a unique value in the 
domain D, and for each function symbol / of arity k, [/] defines a function [/] : D fc i— > D. We can 
inductively extend the definition of [.] to assign semantics to the terms in T in a standard fashion. 
For a numerical domain D such as N (the set of natural numbers) and Z (the set of integers), we use 
C(D, +) to denote the cost model with the cost grammar (?(+), domain D, and [[+] as the standard 
addition operation. Similarly, C(N, +c) denotes the cost model with the cost grammar G(+c), 
domain N, and [+] as the standard addition operation. For cost grammars with two operations 
we denote the cost model by listing the domain, the two functions, and sometimes the subdomain 
to restrict the set of constants used by different rules: C(Q + , +c, [0, 1], *d) denotes the cost model 
with the cost grammar G(+c, *d), the set Q + of non- negative rational numbers as the domain, + 
and * interpreted as standard addition and multiplication, and the rational numbers in the interval 
[0, 1] as the range of encodings corresponding to the scaling factor d. 
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2.3 Cost Register Automata 

A cost register automaton (CRA) is a deterministic machine that maps strings over an input al- 
phabet to cost values using a finite-state control and a finite set of cost registers. At each step, the 
machine reads an input symbol, updates its control state, and updates its registers using a parallel 
assignment. It is important to note that the machine does not test the values of registers, and thus 
the registers are used in a "write-only" mode. The definition of such a machine is parameterized 
by the set of expressions that can be used in the assignments. Given a set X of registers and a cost 
grammar G, we define the set of assignment expressions E(G, X) by extending the set of terms in 
G so that each internal node can be replaced by a register name. For example, for the additive 
grammar G(+), we get the set E(+, X) of expressions defined by the grammar e := +(e, e) | c | x, 
for x £ X; and for the G(+c,*d), we get the expressions E(+c, *d, X) defined by the grammar 
e := +(e, c)\ *(e,d)\c\x, for x £ X. We assume that the ranked alphabet contains a special 
constant symbol, denoted 0, used as the initial value of the registers. 

Formally, a cost register automaton M over a cost grammar G is a tuple (E, Q, qo, X, 5, p, p) 
where E is a finite input alphabet, Q is a finite set of states, qo £ Q is the initial state, X is a finite 
set of registers, 5 : Q x E i— > Q is the state-transition function, p:QxExX4 E(G, X) is the 
register update function, and p, : Q h-» E(G, X) is a partial final cost function. 

The semantics of such an automaton is defined with respect to a cost model C = (G, D, [.]), 
and is a partial function [M, C] from E* to D. A configuration of M is of the form (q,v), where 
q G Q and the function maps each register to a cost in D. Valuations naturally map 

expressions to cost values using the interpretation of function symbols given by the cost model. 
The initial configuration is (qo,vo), where vq maps each register to the initial constant 0. Given 
a string w = a\ . . . a n £ E*, the run of M on w is a sequence of configurations (qo, z/q) . . . (q n , v n ) 
such that for 1 < i < n, <5(gj_i, dj) = qi and for each x £ X, Vi(x) = \vi-i(p{c[i-i, a, x))J. The 
output of M on w, denoted by [M, C](w), is undefined if p(q n ) is undefined, and otherwise it equals 

(K(M<?n))J- 

2.4 CRA-defmable Cost Functions 

Each cost model C = (G, D, [.]) defines a class of cost functions F(C): a partial function / from S* 
to B belongs to this class iff there exists a CRA M over the cost grammar G such that / equals 
[M, CJ. The class of cost functions corresponding to the cost model C(N, +) is abbreviated as 
F(N, +), the class corresponding to the cost model C(Q + , +c, [0, 1], *d) as F(Q + , +c, [0, 1], *d), etc. 

2.5 Examples 

Figure Q] shows examples of cost register automata for E = {a, 6, e}. Consider the cost function 
fi that maps a string w to the length of the substring obtained by deleting all b's after the last 
occurrence of e in w. The automaton M\ computes this function using two cost registers and 
increment operation. The register y is incremented on each symbol, and hence equals the length of 
string processed so far. The register x is not incremented on b symbols, but is updated to the total 
length stored in y when e symbol is encountered. This example illustrates the use of two registers: 
the computation of the desired function f\ in register x update crucially relies on the auxiliary 
register y. 

For a string w and symbol a, let \w\ a denote the count of a symbols in w. For a given string w 
of the form w\ ew2 ■ ■ ■ ew n -\ ew n , where each block Wi contains only a's and 6's, let f2{w) be the 
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Mi: CRA over M 2 : Copyless CRA M 3 : CRA over Mr- CRA over 

(+c) over (min, +c) (min, +c) ( +C) ^ 



Figure 1: Examples of Cost Register Automata 



minimum of the set {|w n _i| a; |w n _i|h, |u; n | a , |w n |&}. The CRA M2 over the grammar G(min, +c) 
computes this function using three registers by an explicit application of the min operator. 

For a given string w = w\ e w-i e . . . ew n , where each Wi contains only a's and b's, consider the 
function that maps w to min^Z'j (|tui| a + \w2\a + • • ■ \wj\a + \ w j+i\b + • • • + This function 

is computed by the CRA M3 over the grammar G(min, +c). 

The final example concerns use of scaling. Consider a computation where we wish to charge 
a cost of 10 upon seeing an a event until a b event occurs. Once a b event is triggered, for every 
subsequent e event, the cost is discounted by 5%. Such a cost function is computed by the CRA 
M4 over the grammar G(+c,*d). 

2.6 Copyless Restriction 

A CRA M is said to be copyless if each register is used at most once at every step: for each state 
q and input symbol a and each register x € X, the register x appears at most once in the set of 
expressions {S(q, a,y) \ y £ X} and x appears at most once in the output expression fj,(q). Each 
cost model C = (G,B, [.]) then defines another class of cost functions F C (C): a partial function / 
from X* to D belongs to this class iff there exists a copyless CRA M over the cost grammar G such 
that / equals [M, CJ. In Figure [TJ the automata for function /2 and f± are copyless, while the ones 
for fi and fa are not. 

2.7 Regular Look-Ahead 

A CRA M R with regular look-ahead (CRA-RLA) is a CRA that can make its decisions based on 
whether the remaining suffix of the input word belongs to a regular language. Let L be a regular 
language, and let A be a DFA for reverse(L) (such a DFA exists, since regular languages are closed 
under the reverse operation). Then, while processing an input word, testing whether the suffix 
aj . . . belongs to L corresponds to testing whether the state of A after processing . . . aj is an 
accepting state of A. We now try to formalize this concept Let w = a% . . . a& be a word over S, and 
let A be a DFA with states R processing words over E. Then the A-look-ahead labeling of w, is the 
word wa = r i r 2 ■ ■ ■ r k ° ver the alphabet R such that for each position 1 < j < k, the corresponding 
symbol is the state of the DFA A after reading ■ ■ ■ aj (it reads the reverse of the word) . A 
CRA-RLA consists of an DFA A over S with states R, and a CRA M over the input alphabet R. 
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i"(go) = x 



Figure 2: On the left a CRA-RLA over (+c) corresponding to Mi in Figure [TJ On the right 
the corresponding labeling automaton. The states of A corresponds to the languages used in the 
informal description of Mi . 

The output of CRA-RLA (M, A) on w, denoted by [(M, A),C}(w), is defined as lM,C}(w A ). In 
Figure [2] we show the CRA-RLA for Mi of Figure HJ 

3 Regular Cost Functions 

Consider a cost grammar G = (F,T). The terms in T can be viewed as trees: an internal node is 
labeled with a function symbol / of arity k > and has k children, and each leaf is labeled with a 
constant. A deterministic streaming string-to-tree transduction is a (partial) function / : E* i— >■ T. 
The theory of such transductions has been well studied, and in particular, the class of regular 
string-to-tree transductions has appealing closure properties, and multiple characterizations using 
Macro-tree-transducers (with single-use restriction and regular look-ahead) [IT], Monadic-Second- 
Order logic definable graph transformations [12] , and streaming tree transducers [4] . We first briefly 
recap the model of streaming string-to-tree transducers. 

3.1 Streaming String-to-Tree Transducers (sstt) 

A streaming string-to-tree transducer is a deterministic machine model that can compute regular 
transformations from strings to ranked trees in a single pass. We note that SSTT can be viewed 
as a variant of CRA 0, where each register stores a term, that is, an uninterpreted expression, 
and these terms are combined using the rules allowed by the grammar. To obtain a model whose 
expressiveness coincides with the regular transductions, we must require that the updates are 
copyless, but need to allow terms that contain "holes", i.e., parameters that can be substituted by 
other terms. 

Let G = (F, T) be a cost grammar. Let ? be a special 0-ary symbol that denotes a place 
holder for the term to be substituted later. We obtain the set T ? by adding the symbol ? to 
F, and requiring that each term has at most one leaf labeled with ?. For example, for the cost 
grammar G(+c), the set T ? of parameterized terms is defined by the grammar t := +(t,c) |c|?; 
and for the cost grammar G(+), the set T ? of parameterized terms is defined by the grammar 
t' := +{t, t') | + (t 1 , t) | c | ?, where t stands for (complete) terms generated by the original grammar 

To make the connection precise, we need to allow registers in CRAs to be typed, and use function symbols with 
typed signatures. We also use fi to denote the output function instead of F,. For simplicity of presentation, we defer 
this detail to a later version. 
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t := +(t,t) | c. A parameterized term such as min(5,? + 3) stands for an incomplete expression, 
where the parameter ? can be replaced by another term to complete the expression. Registers of an 
SSTT hold parameterized terms. The expressions used to update the registers at every step are given 
by the cost grammar, with an additional rule for substitution: given a parameterized expression e 
and another expression e', the expression e[e'] is obtained by substituting the sole ?-labeled leaf in 
e with the expression e! '. 

Given a set X of registers, the set E~ (G,X) represents parameterized expressions that can be 
obtained using the rules of G, registers in X, and substitution. For example, for the grammar G(+c) 
and a set X of registers, the set £ ,? (+c, X) is defined by the grammar e := +(e, c) | c | ? | x | e[e], 
for x € X. The output of an SSTT is a (complete) term in T defined using the final cost function. 
The register update function and the final cost function are required to be copyless: each register 
is used at most once on the right-hand-side in any transition. The semantics of an SSTT gives a 
partial function from E* to T. We refer the reader to [I] for details. 

3.2 Regular Cost Functions 

Let S be a finite input alphabet. Let D be a cost domain. A cost function f maps strings in S* 
to elements of D. Let C = (G,D, [.J) be a cost model. A cost function / is said to be regular with 
respect to the cost model C if there exists a regular string-to-tree transduction g from X* to T such 
that for all w 6 £*, f(w) = lg(w)}. That is, given a cost model, we can define a cost function 
using an SSTT: the SSTT maps the input string to a term, and then we evaluate the term according 
to the interpretation given by the cost model. The cost functions obtained in this manner are the 
regular functions. We use M(C) to denote the class of cost functions regular with respect to the 
cost model C. 

As an example, suppose £ = {a, b}. Consider a vocabulary with constant symbols 0, c a and q, 
and the grammar G(+c). Consider the SSTT U with a single register that is initialized to 0, and 
at every step, it updates x to +(x, c a ) on input a, and q) on input b. Given input w\ . . . w n , 
the SSTT generates the term e = +(■ ■ ■ + (+(0, ci), C2) • • • c n ), where each q = c a if Wi = a and 
q = q otherwise. To obtain the corresponding cost function, we need a cost model that interprets 
the constants and the function symbol +, and we get the cost of the input string by evaluating 
the expression e. Now, consider another SSTT U' that uses a single register initialized to ?. At 
every step, it updates x to c a )] on input o, and »[+(?, q)] on input b, using the substitution 

operation. The output is the term x[0] obtained by replacing the parameter by 0. Given input 
w\ . . . w n , the SSTT generates the term e' = +(• • • + (0, c n ), ■ ■ ■ ci), where each q = c a if uii = a 
and Cj = Cb otherwise. Note that the SSTT U builds the cost term by adding costs on the right, 
while the SSTT U' uses parameter substitution to build costs terms in the reverse order. If the 
interpretation of the function + is not commutative, then these two mechanisms allow to compute 
different functions, both of which are regular. 

Let's now consider a different grammar. Consider the constant symbols 1, c a and c&, and the 
grammar G(+c, *d). Consider the SSTT U with a single register that is initialized to 1 and, at every 
step, it updates x to c a ), 1) on input a and to q,), 1) on input b. Given input w\ . . . w n , 

the SSTT generates the term e = +(*(+(*(• • • + Q>), 1) • • • , Cn-l), 1)> c„), 1), where each q = c a 
iiwi = a and q = q otherwise. Now, consider another SSTT U' that uses a single register initialized 
to ?. At every step, it updates x to x[+(*(?,c a ), 1)] on input a, and x[+(?,q)] on input b, using 
the substitution operation. The output is the term x[l] obtained by replacing the parameter by 1. 
Given input w\ . . .w n , the SSTT generates the term e = +(*(+(*(• •• + (*(!, q,), 1) •• • , C2), 1), ci), 1), 
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where each q = c a if Wi = a and Cj = c& otherwise. The same considerations as before follow. 
3.3 Closure Properties 

If / is a regular cost function from X* to a cost domain B, then the domain of /, i.e., the set of 
strings w such that f(w) is defined, is a regular language. Closure properties for regular string- 
to-tree transductions immediately imply certain closure properties for regular cost functions. For 
a string w, let w r denote the reverse string. We define a reverse function f r : X* h > O such that 
for all u> 6 X*, f r (w) = f(w r ). Given cost functions /i, /2 from X* to D and a language L C X*, 
the choice function "if L then /i else /a" maps an input string w to fi(w) if tu G L, and to f2(w) 
otherwise. If the two cost functions fx and /2 are regular and if L is a regular language, then the 
choice function is also regular. We now show that regular cost functions are closed under reverse 
and regular choice. 

Theorem 1 (Closure Properties of Regular Cost Functions) For every cost model C, 

(a) if a cost function f belongs to the class K(C), then so does the function f r ; 

(b) if cost functions f\ and fi belong to the class E(C), then so does the function "if L then f\ 
else fi" for every regular language L. 

Proof, (a) The proof of the first statement follows from the application of theorems from [2j|3]. 
Let / be a regular cost function belonging to the class E(C), and let U be an SSTT that computes 
the function /. A streaming string transducer (sst) is a machine similar to an SSTT. It maps strings 
to strings with the help of a fixed number of registers that store strings, and uses updates that 
involve concatenating registers and strings in a copyless fashion. Given an alphabet X, computing 
the reverse w r of strings w G X is an SST-definable transduction [2j. Let SST U' that computes 
such a transduction. As proved in [4], the composition of an SST-definable transduction and an 
SSTT definable transduction is an SSTT definable transduction. Thus the sequential composition of 
V and U is also SSTT definable. Thus, for every regular function / in R(C), we can construct the 
SSTT U o U' that maps every string w to the corresponding term f(w r ), which is, by definition, the 
cost function f r [w). Thus, if / belongs to E(C), so does f r . 

(b) To prove the second statement, we show how we can construct an SSTT U c that defines 
the function "if L then fx else /2." As f\ and f2 are regular cost functions, they are definable by 
SSTT U\ and U2 respectively. The SSTT U c maintains 2 disjoint sets of registers, where the first 
set corresponds to registers of U\ and the second to the registers of U2- A state of U c is a tuple 
(q, qi, q%), where q\ and q2 exactly track the states of Ui and U2, and q is the state of the DFA 
corresponding to the regular language L. The output function of U c is defined such that if the input 
word w is in L (i.e., it is accepted by the corresponding DFA), then U c uses the first set of registers 
to compute the output, and, otherwise uses the second set of registers. For any cost model, the U c 
exactly defines the choice function. □ 

An SSTT with regular look ahead is a pair (U, A) where U is an SSTT and A a DFA. As discussed 
earlier regular- look-ahead tests allow machines to make its decisions based on whether the remaining 
suffix of the input word belongs to a given regular language. SSTT are closed under the operation 
of regular-look-ahead which implies the same for regular cost functions. 

Theorem 2 (Closure Under RLA) For every cost model SSTT with regular-look- ahead (U,A), 
there exists an SSTT U' without regular-look-ahead which computes the same function. 
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Proof. Theorem 9 in [3J. □ 



3.4 Constant Width and Linear Size of Output Terms 

While processing symbols of an input string w, in each step an SSTT performs a copyless update. 
Thus, the sum of the sizes of all terms stored in registers grows only by a constant additive factor. 
It follows that |?7(io)| is 0(|w;|). Viewed as a tree, the depth of U(w) can be linear in the length of 
w, but its width is constant, bounded by the number of registers. This implies that if / is a cost 
function in R(N, +c), then |/(tu)| must be 0(|w|). In particular, the function f(w) = \w\ 2 is not 
regular in this cost model. Revisiting the examples in Sec. El it turns out that the function fi is 
regular for C(N, +c), and the function fi is regular for C(N, min, +c). The function f% does not 
appear to be regular for C(N, min, +c), as it seems to require 0(|u>| 2 ) terms to construct it. 

4 Commutative-Monoid Cost Functions 

In this section we explore and analyze cost functions for the cost models of the form (D, ®), where D 
is a cost domain (with a designated identity element) and the interpretation [®] is a commutative 
and associative function. 

4.1 Expressiveness 

Given a cost model (D, ®), we can use regular string-to-term transductions to define two (machine- 
independent) classes of functions: the class M(D, ®c) defined by the grammar G(®c) and the class 
M(D, ®) defined by the grammar G(®). Relying on commutativity and associativity, we show these 
two classes to be equally expressive. This class of "regular additive cost functions" corresponds 
exactly to functions computed by CRAs with increment operation, and also, by copyless-CRAs 
with addition. We are going to show the result via intermediate results. 

Lemma 3 For cost domain D with a commutative and associative operation [®], M(B,®c) = 
R(D, ®). 

Proof. R(B, ®c) C R(B, ®) is trivially true. We now prove the other direction. Consider a 
function / : X* i— > D over R(B, ®) defined by the SSTT f7. In effect, C7 outputs trees over the 
grammar G(®). We show that there exists an SSTT U' that outputs trees over the grammar G(®c) 
and computes /. As shown in [3] the class of string-to-tree transductions computable by an SSTT 
coincides with the class of transductions computed by MSO transducers |T7|. Thus, for a given 
SSTT U, there is an MSO string-to-tree transducer M that computes /. 

The yield of a tree t is the string obtained by concatenating all the leaves of t as they appear 
in an pre-order search. In Lemma 7.6 of [17], the authors prove that computing the yield of a tree 
is an MSO-definable transduction. Consider the MSO tree-to-string transducer M' that computes 
the yield of trees over the grammar G(<8>). Essentially, the yield of a tree t generated by the SSTT U 
contains constant symbols. If [®J is a commutative and associative operator, the order in which the 
symbols appear is not important, and any tree t' with internal nodes ® and leaves corresponding 
to the yield of t represents an equivalent expression to the one represented by t. 

M' outputs strings over C where C is the set of constants in the output alphabet of M. Given 
a string c\. . . c n over C we want to produce the string c\ ® (c2 ® (. . . (c„ ® (0)))) that belongs to 
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the grammar G(<8>c). This transduction is clearly MSO-definable (it is a simple relabeling). Let 
M" be the MSO transducer that computes this relabeling. 

As MSO transducers are closed under composition [T7] the transduction Mf = M o M' o M" 
is also MSO-definable. Observe that for any string w 6 S*, Mr[w) is a term equivalent to the 
function f(w), but is a term over the cost grammar G(®c) (due to associativity and commutativity 
of \®\). Clearly, Mf computes the function /. As MSO-definable transductions are equivalent to 
SSTT-definable transductions, there is an SSTT U' equivalent to Mf. Thus for every / in M(D, ®) 
(defined by the SSTT U) there is an SSTT U' that defines an equivalent function in R(B, ®c). □ 

Lemma 4 For cost domain D with a commutative associative operation ®, F(D, <8>c) C M(D, <8>c). 

Proof. Consider a cost function / : E* i— > D belonging to F(D, ®c). We show that we can construct 
an SSTT [7 such that for all w G E*, p7](io) = f(w). 

Recall that a cost function in F(D, <8>c) is definable by a CRA M over (D, ®c), where M is given 
by the tuple M = (E, Q, qo, X, 5, p, p). In order to construct the desired SSTT U, we first construct 
an SSTT with regular-look-ahead, denoted by (U',A) that computes /. Here U' is an SSTT defined 
by the tuple (Q,q ,{v},6' , p' , p') and A is a DFA (R, ro,8ii) specifying the regular-look-ahead. 
Given an input string w, recall that U' reads i?-labeled words corresponding to the run of A on the 
reverse string w r . 

The final cost function p of the CRA M maps a state to a term in E{G,X). This can be 
extended to the partial function p* : Q x E* i-> E(G,X) as follows. For all q, p*(q,e) = p{q), 
and p*(q, aw) is obtained by replacing each x in p*(5(q, a),w) by the expression p(q, a, x). p*(q, w) 
gives the output of M starting in state q after reading w. For the grammar G(<8>c), it is easy to 
show by induction that for all q and w, the expression p*(q, w) contains at most one register name. 

We now describe how the RLA automaton A = (R,ro,5n) is constructed. Consider an input 
string w = w\ . . . w n , and recall that A reads the reverse string w r . At position i, we need A to 
report the register name that will contribute to the final output, i.e., the register that "flows" into 
the final output after M reads the remaining suffix Wj + \ . . . w n . To do so, each state of A is a pair 
of the form (a, \) such that a E (E U {e}) and x '■ Q ^ X U {e} is a function mapping every state 
in Q to a register name or a special empty symbol. While reading a string w\ . . . w n in the reverse 
order, the invariant maintained by a state (wi,x) of A is that if for each state q € Q, if the CRA 
M reads the symbol Wi, then the register name that flows into the final output after reading the 
string w i+ i . ..w n is x(q)- 

The initial state of A, ro = (s, xo), where Xo(l) = x if x is the (only) register name appearing 
in p(q), and is e if no register name appears in p(q). Note that the first component of the state, 
i.e., the input symbol is not used at this point as this corresponds to the case where the SSTT has 
reached the end of the string. We define Sr using the register update functions of M as follows: 

Suppose A is in state {wi + \,x) and it reads the symbol W{. We define 5r((wi+i, x)> w i) = ( w ii x')i 
where the function x' is defined as follows: 



Vg G Q s.t. 6{q,Wi) = q' , x'(q) 



e if x(q') = e or if p(q', Wi,x) = c 
V if p{q',Wi,x) = y + c. 

In the above definition, c is some constant in D. We can now define how the state transition 
function 5' and register update function p' of U' are defined: 5'(q, (a, x)) = q' if in M, S(q, a) = q' . 
In state q, U' exactly knows the register x(q) that contributes to the final output by reading the 
symbol (a, x)- Thus, it is enough for V to have just one register (denoted v). For an expression t 
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in E(G,X), let t[x h- > v] be the expression obtained by renaming the register x to v. The register 
update function p'(q, (a, x)i v ) is defined to be p(q, a, x(l))[ v x ] if xil) e > an d otherwise. 

Finally, as SSTT are closed under regular-look-ahead, there exists an SSTT U equivalent to the 
SSTT with regular-look-ahead U' . Thus for every function definable by CRA over (D, <g>c), there 
exists an SSTT J7 that computes /, which means that / is in R(B, <g>c). □ 

Lemma 5 For cost domain B with a commutative associative operation <g>, F c (B,(g>) Q F(B,(g>c). 

Proof. Consider a function / : S* D belonging to F C (B,&>). Let M = (Y,,Q,q Q ,X,S, p, p) 
be a copyless CRA over (D, g>) that computes /. We construct an CRA M' = (£, Q, go> 2 X , (5, p', p!) 
over (D, ®c) that also computes /. 

For every subset SCI, M' maintains a register denoted by xs- M' maintains the following 
invariant: If the configuration of M is (q, v), then the corresponding configuration of M' is (q, v') 
such that for all xg € 2 , v'(xs) = X^esK 2 -)- Informally, each register maintains the sum of 
the registers in the set S. 

The update function p' of M' corresponding to the updates of M can be defined as follows. 
Let us = SxeS P^Q-i a i x )- Note that the expression us is composed of two parts: an expression 
denoting the sum of register names, and a constant obtained by summing all the constants in each 
of the p(q, a, x) expressions. Let reg(us) denote the set of registers appearing in us and let co(us) 
denote the computed constant. Then, we define p'{q,a,xs) = x reg ^ Us ^ + co{us)- Note that as M 
is copyless, for every S, any register x G X appears in the expression us at most once. Also note 
that p' may not be copyless as the registers xr denoting the same subset R may appear in two or 
more expressions p'(q,a,xs)- The output function can be define in a similar fashion by defining 
the expression us to be SxesM^)' an d setting p'(q) = x reg (u s ) + co(us)- □ 

See Fig. [3] for an example of this construction. The CRA on the right of the figure will have one 
register for every possible subset of registers of the CRA on the left of the figure. Let's consider 
the transition on the symbol b. In the original CRA the update performed on x is x := x + y + z 
and all the other registers are reset to 0. This means that for all the S containing x, xs is updated 
to 

x {x,y,z} while all the other registers are reset. 

Lemma 6 For a cost domain D with a commutative and associative operation [(g)]] , M(D, (gc) C 
F c (D,0). 

Proof. Consider a function / : S* h4 B belonging to M(B,<g>c), i.e., there is an SSTT U = 
(Q, qo, X, 5, p, p) over the cost grammar G(g)c) such that \U\ = f, for the cost model (B,<g>c, [.]). 
We show how we can construct a copyless CRA M = (S, Q, qo, X, 5, p', p!) over (B,<g>) that also 
computes /. 

The CRA faithfully mimics the computation of the SSTT U in its state. The only difference 
is the register update function and the final output function. The translation ensures that M 
maintains the invariant that if a configuration of U is (q, v), the corresponding configuration for M 
is {q,v') such that for all x £ X, [f(a:[0])]] = v'{x). 

A register update expression p(q, a, x) for U has one of the following forms: x := (g(x, c), x := c, 
{x := x[y],y :=?}. Except for the last assignment, each RHS expression is in E(G{®c),X), and with 
[(gj = +, can be can be directly mimicked by M by setting p'(q, a, x) to be the expressions x := x+c 
and x := c respectively. To simulate {x := x[y],y :=?}, we note that as <g> is associative and 
commutative, for a term in E{G{®c),X), the term x[y] is equivalent to the term (g>(x, y). Thus, with 
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Figure 3: Translation from copyless CRA (+) to CRA (+c) 



\®\ = +, M can simulate parameter substitution by the copyless assignment {x := x <g> y, y := 0}. 
The output function fJ,'(q) can be mimicked in a similar fashion for the corresponding expressions 
in fi(q). □ 

Theorem 7 (Expressiveness of Additive Cost Functions) For cost domain D with a com- 
mutative associative operation ®, F(D, ®c) = F C (D, <g>) = E(B, ®c) = M(D, ®). 

Proof. Follows from Lemma I3I4I5I and [6l □ 

We can also establish the following results establishing an expressiveness hierarchy between 
different classes. First we show that the copyless restriction for CRA over (D, ®c) reduces the 
expressivity. 



Theorem 8 ¥( 



Proof. Consider the function f\ computed by the CRA M\ in Fig. [Jthat maps a string w to 
the length of the substring obtained by deleting all b's after the last occurrence of e in w. We show 
that for any fixed k, there does not exist a copyless CRA capable of computing this function. 

Assume that M is a copyless CRA over (D, ®c) that can compute this function with k registers. 
Without loss of generality we can assume that in every assignment of the form x := y + c, y and x 
are the same register (if the original machine is doing some copyless renaming, we can remember 
the renaming in the state). 

Now consider a string of the form w = b n aeb n ae . . ., where n is greater than the number of states 
in M. Thus, for each i, while processing the i^ 1 block of b n , some state qi (possibly depending 
on the block number i) must be visited at least twice. For each i, let x\ be the register used to 
calculate the output after reading the input string {b n ae) 1 a. Now observe that for all i < j, Xi 
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Figure 4: CRA implementing f)~. Each transition performs V{ := + i, for all i. 



and Xj are necessarily distinct: if we pump w to w' = (b n aeY~ l b n+l ae {b n ae) 3 1 . . ., the value of Xi 
after i blocks must be unchanged, but the value of Xj must change. We have thus established that 
a different register must be used to produce the output after each block, but this is not possible if 
the machine has only a finite number (k) of registers. □ 

We then show that removing the copyless restriction from CRAs over (D,®) is too permissive 
as it allows computing cost functions that grow exponentially. 

Theorem 9 There exists a CRA M over (D, <g>) that cannot be expressed as a copyless CRA M over 
(D,<8>), i.e., F(D,<8>) £ F C (D,®). 

Proof. We prove this result by contradiction. We create a CRA M over (D, <g>) over the alpha- 
bet {a} such that on the first a it perform the update x := 2 and on the subsequent a's it performs 
the update x := x + x. Given a string w € a + , the function computed is / = 2^1. Let's assume 
that there exists a copyless CRA M' that can compute /. By Theorem there must be an SSTT U 
in R(B, <g)c) that also computes /. However, as the output is not linearly bounded, this function 
cannot be computed by an SSTT and so we reach a contradiction. □ 

Finally we show that having multiple registers is essential for expressive completeness. 

Theorem 10 For every k £ N, there is a cost function f so that every CRA M over F(N, +c) has 
at least k registers. 

Proof. For each k, consider the function : N — > N defined as (x) = ((x mod k) + 1) .x. 
The input x is expressed in a unary alphabet X = {1}. This function outputs one of x, 2x, 3x, . . . , 
kx, depending on the length of x. 

First, these functions can be implemented by a CRA (shown in Fig. |4|). has k registers 
vi, Vk, all initialized to and has k states qo, q^-x- For each i, 5(qi,l) = qu+x) mo d fc> 
p{qi, 1, a*) =Vi+i, and fifa) = v {i+1 ) mod k . 

We now show that at least k registers are necessary. Consider otherwise, and say we are able 
to produce such an CRA M with k — 1 registers. The main idea is that the difference between 
any two "sub" -functions ax and bx, a 7^ 6, grows without bound Since M works over a unary input 
alphabet, the only form it can assume is that of a lasso (Fig. [5|). Say there are m > states in the 
initial approach to the loop, and n > 1 states in a single pass of the loop. 

In the following argument, let c denote the largest constant that appears in the description of 
M. Without loss of generality, we can assume that no register renaming occurs, as for a CRA 
with register renaming, there is an equivalent CRA with no register renaming, by tracking register 
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n > 1 states 



Figure 5: The structure of a possible k — 1-register CRA M implementing 

renaming as part of its state. Thus, if in state q, if p(q, 1, Vi) is the expression Vj + ci (where i ^ j), 
then there is no vg such that Vi appears in p(q, 1, ve). 

Also observe that no register that ever gets reset during the loop can contribute to the output 
afterwards. Say there is some transition during which vi is updated as Vi := C3. If this register 
influences the output n' > states later, then for x > en! + C3, the output must be incorrect. 

Now pick some state q occurring in the loop. Say that the output in q depends on some register 
V{. Let us call q good if register Vi in q influences the output in some state q' which is p < k 
transitions from q. At least one good state has to exist, since the machine has at most k — 1 
registers. We now use the presence of this good state to derive a contradiction: the outputs in q 
and q' can differ by no more than (p + 2).c. But since q and q' are closer than k steps apart, they 
output necessarily different functions, and hence for x > (p + 2).c, the outputs in these two states 
are required to differ by more than this amount. The contradiction is complete. □ 

We now prove some closure properties of the model. 

Theorem 11 (Addition) Given two CRAs Mi and M 2 over the cost model (Q, +), there exists 
a M over the same cost model such that Vw G £*. M(w) = M\(w) + M2(w). 

Proof. Given M, L = (T l ,Q i ,q} ) ,X i ,S i , p\ (where i € {1,2}) we construct M = {Y>,Q l x 
Q 2 , (<7q, <7q), X 1 x X 2 U X 1 U X 2 ,5,p,/j,). We assume Ai and A2 are disjoints sets. The registers 
in M are pairs of the form (x,y) or singletons of the form x. Whenever Mi, while processing an 
input word w, is in the configuration (qi,Vi), M has the configuration ((qi, 52), v) such that: (1) 
v((x, y)) = ui{x) + v 2 (y), (2) v{x) = vi{x) if x G X 1 , and (3) u(y) = v 2 (y) if y G X 2 . 

We now define the update functions <5 and p. Given (^1,^2) G Qi x Q 2 ,a G H,(xi,X2) G 
A 1 x X 2 , define <5((«i, w 2 ), a) = ((5 1 (fi, a), 5 2 (w 2 , a)), and p((^i, w 2 ), a, (xi,x 2 )) = reg(p 1 (v\, a, x\) + 
p 2 (v\,a, x 2 )) + co(p 1 (vi, a, x\) + p 2 (vi,a,x 2 )). 

Given an expression e, regie) is equal x when x is the only variable appearing in e, to (x, y) 
when x G A 1 ,?/ G X 2 are the only two variable appearing in e and e when no variable appears in 
e, while co(e) is the sum of all the constants appearing in e. Note that, at every point, at most 
2 registers can appear in the combined right-hand side. The output function can be defined in a 
similar way. By construction M computes the right function. □ 

Theorem 12 (Subtraction) Given two CRAs M\ and M 2 over the cost model (Q, +), there exists 
a M over the same cost model such that G S*. M[w) = M\(w) — M 2 [w). 
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Proof. Similar to the proof for addition. □ 



4.2 Weighted Automata 

A weighted automaton p3] over an input alphabet E and a cost domain D is a nondeterministic 
finite-state automaton whose edges are labeled with input symbols in E and costs in D. For an input 
string w, the automaton can have multiple accepting paths from its initial state to an accepting 
state. The semantics of the automaton is defined using two binary functions © and ® such that © 
is associative and commutative, and © distributes over © (to be precise, form a semiring algebraic 
structure) . The cost of a path is the sum of the costs of all the transitions along the path according 
to ©, and the cost of a string w is obtained by applying © to the set of costs of all accepting paths 
of the automaton over w. 

Let K be the semiring (D, ©, ®, 0, 1). Formally, a weighted automaton WA with weights from 
K, from an input alphabet E into the domain D is a tuple W = (E,P,I,F,E,\,p) where E is a 
finite input alphabet, P is a finite set of states, I C P the set of initial states, F C P the set of 
final states, E a finite multiset of transitions, which are elements of P x E x D x P, A : / i— > D an 
initial weight function, and p : F i— >• O a final weight function mapping P to D. 

Consider a string u> = itfi . . . w n G E* and a WA A sequence ir = (go, Co), (<?i, ci), . . . , ((/„, c„) 
is an accepting in sequence for w if, for every 1 < i < n, Wi, Ci, <&) G S and A(qo) = cq. The 

weight of 7r (denoted as w(n)) is computed as (©o<i<nCj) © p{Qn)- Given a word w we denote by 
P(u;) the set of all the accepting paths of w. The weight T(w) of the string uu is defined as: 

T( W ) = w [w] 

ireP(s) 

where © is also an operation over S. A weighted automaton is called single-valued if each input 
string has at most one accepting path@. To interpret a single-valued weighted automaton, we 
need only an interpretation for ©. Thus, we can compare the class of functions definable by such 
automata with regular additive functions. 

Theorem 13 (Single-valued Weighted Automata) A cost function f : E* i-> O is mM(D,©c) 
iff it is definable by a single valued weighted automaton. 

Proof. Let W = (E, P, I, F, E, A, p) a single valued weighted automaton that computes the 
function / : E* ^ D. We construct an SSTT with regular look-ahead (U, A) = (PU{/}, /, {v}, 5, p, //), 
(R,rQ,5n) that computes /. The SSTT U uses the cost grammar G(©c) to construct its terms. We 
then use Theorem [2] to show that there exists an SSTT that computes /, which means that / is 
regular. 

Even though W is nondeterministic, since it is single valued, it will have only one accepting 
path. We construct A = (R,ro,5n) such that the states in R give information on what is the 
next transition to take to reach an accepting path. Every state r G R is a pair (a, /) where 
a G (E U {e}) and / is a partial function from P to P. After reading the i th symbol of the input 
word w = w\ . . . w n , r = (w{, f) and f(q) = q' if: 1) (q, a, c, q') G E for some c, and 2) if W starts 
reading . . . w n in q' , it will reach an accepting state. 

2 In some of the literature these are called unambiguous weighted automata, while a single-valued weighted au- 
tomaton is one where the weights of all the accepting paths are the same. The two notions are proved to be equivalent. 
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Figure 6: Weighted Automaton corresponding to Mi in Figure Q] 



The initial state tq is defined as (e, /o)- and for every q £ F fo(q) = q. The initial state does 
not encode any information as it corresponds to the case where the U has reached the end of the 
string w. We now define Sr. Suppose A is in state (a,/) and it is reading the input b. The new 
state will be (b, /') where f'(q) = q' if /(<?') is defined and (q, b, c, q') € E for some c. 

We now define the state transition function for the SSTT U. We define 5(q, (a, /)) = g' if 
/(g) = q'. Particular attention must be made for the case when U is in state /. In this case on 
input symbol (a,/), 6(1, (a, /)) = q' where q' is the only state in / such that /(g) = q' . Notice 
that there can be only one state of this form otherwise W would not be single valued. For every 
transition of the form (q, a, c, f(q)) in W, in U, the register update function p(q, a, v) maps v to the 
expression c). This shows that for every weighted automaton W, we can construct an SSTT U 
over the cost grammar G(ffic) such that for all input strings w, jUj{w) = W(w). 

We now prove the other direction. By Theorem [7] we know that every function / in M(D,<gic) 
can be computed by a CRA M over (D,®c). Let M = (S, Q, qo, X, 5, p, fj,). We show how we can 
construct a single valued weighted automaton W = (E, Qx (IU{r)), {go} X (A^U{r}), QxX,E, A, p) 
that also computes /. 

After processing an input word w, if M has the configuration (q, u), we have that: corresponding 
to every x £ X there exists a path in from the initial state such that the cost along that path 
is equal to v(x). Let's now give the definition of E: (1) if 8(q, a) = q' and p(q, a, x) = <8>(y, c), then 
((q, y),a, c, (q',x)) E E, (2) if 5(q, a) = q' and p(q, a, x) = c, then ((</, r), a, c, (q' ,x)) G E 1 , and (3) if 
5(g, a) = q', ((q, r), a, 0, (q 1 , r)) € -E, The nodes (g, r) are always reachable with cost and are used 
to represent resets, but none of them is accepting, p can be defined in a similar way. 

A simple inductive proof establishes that in a CRA over (D, <8>c), in any state, only one register 
eventually contributes to the final output, or in other words, only one value flows to the final output. 
Thus, the constructed weighted automaton is is single-valued. □ 

An example of the translation of the function M\ of Figure [U is in Figure [U The machine has 
only two states (qo,x), and (qo,y). If we take for example the transition of the first automaton 
when reading c, we can see that x is updated to y + 1. In the automaton of Figure [6] this is reflected 
by the transition from (qo,y) to (qo,x) with label c and with weight +1. 

4.3 Decision Problems 

Minimum Costs. The shortest path problem for CRAs is to find a string w whose cost is 
the minimum. For numerical domain with addition, for CRAs with increment, we can solve the 
shortest path problem by reducing it to classical shortest paths using the translation from CRAs 
with increment to single- valued weighted automata used in the proof of Theorem [131 If the CRA 
has n states and k registers, the graph has n ■ k vertices. The exact complexity depends on the 



16 



Kio) = x +y 

M(9i) = x 
^(92) = V 

Figure 7: Example CRA over ((g)) needing less registers than CRA over (<8>c) 

weights used: for example, if the costs are nonnegative, we can use Dijkstra's algorithm. 

Theorem 14 (Shortest Path for CRAs with Inc) Given a CRA M over the cost model (Q, +c), 
computing min{M(u;) | w G T,*} is solvable in PTlME. 

Proof. We reduce the problem to shortest finding the shortest path in a weighted graph. 
Using the construction of Theorem [13] we create a weighted graph. The graph has nk nodes and 
edges where n,k are the number of states and variable of M respectively. If the weights 
are all positive we can use Dijkstra's algorithm with a final complexity of 0(|S|nfc + nk\og{nk)) 
otherwise we can use the Bellman-Ford algorithm, making the complexity 0(n 2 k 2 log(nfc)). □ 

Even though F(D, ®c) = F C (D,®), the model with addition can be more succinct (see Fig. [7] 
for an example). To solve minimum-cost problem for copyless-CRAs over the cost model (D, ®), 
we can use the translation to CRAs over (D, ®c) used in the proof of Theorem which causes a 
blow-up exponential in the number of registers. We can establish an NP-hardness bound for the 
min-cost problem by a simple reduction from 3-SAT. 

Theorem 15 (Shortest Paths for CRAs with Addition) Given a copyless-CRA M over the 
cost model (Q, +) with n states and k registers, computing m.m{M(w) \ w £ S*} is solvable in time 
polynomial in n and exponential in k. Given a copyless CRA M over the cost model (N, +) and a 
constant f£N, deciding whether there exists a string w such that M(w) < K is NP-Hard. 

Proof. The first result follows from the complexity of the translation in Lemma [5j For the 
second part we give a reduction from 3-SAT. Given an instance V = {v\, . . . , v n }, C = {ci, . . . , c^} 
where V is the set of literals and C the set of clauses we construct a CRA M over (N, +). M is de- 
fined as the tuple (£, Q, qo,X, 5, p, fj,), where X = {0, 1}, Q = {q^, . . . , q n +2}, A = {xi, . . . , x^}. The 
update functions are defined as follows: For each i, and b G S, 5(qi,b) = qi+i, and p(qi,0,Xj) = 
if the clause Cj becomes true when the variable Vj, is false. Similarly p(qi, 1, Xj) = if the clause Cj 
becomes true when is true. Finally, we define p(q n+ i,b,xi) = J2i=i x ii an d ^{qn+2) = aci- It is 
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easy to see that every path in the M corresponds to a unique valuation for the literals v\ , . . . , v n . 
Finally, if the minimum-cost computed by M is 0, then we have an instance of SAT, as there is 
a valuation of the literals that makes every clause true. If the minimum-cost computed is greater 
than 0, then the conjunction of the clauses is unsatisfiable. This means that solving min-value 
problem for a CRA over (N, +) is as hard as solving 3-SAT. □ 

Equivalence and Containment. Given two cost register automata using addition over a nu- 
merical domain, checking whether they define exactly the same function is solvable in polynomial 
time relying on properties of systems of linear equations. 

Theorem 16 (Equivalence of CRAs with Addition) Given two CRAs M\ and M 2 over the 
cost model (Q, +), deciding whether for all w, M\(w) = M^iw) is solvable in PTime. 

Proof. Given M< = (T l ,Q\q} ) , X 1 ,5\ p\ fi l ) (where i G {1,2}) we construct M = (E,^ 1 x 
Q 2 , (<Jq, 9q), AT 1 U X 2 , 5, p, p). We assume the two sets variables X\ and X2 are disjoint. 

For every (91,92) £ Q 1 x Q 2 and a G E, S((q q , I2), a) = (£1(91, a), 62(92, a)). M updates the 
two constituent sets of registers separately. For x\ G X\, p((q%, 92), «, %\) = Pi (91, a, x±), and for 
x 2 G X 2 p((qi, 92), a, x\) = p 2 (q 2 , a, x 2 ). 

We want to check if along every path of the M, and for every state (91,92), the equation 
Ml (91) = ^2(92) holds. We adapt the algorithm for checking validity of afline relations over affine 
programs presented in |28j to do this. The algorithm in [28], checks the validity of affine relations 
(equality constraints over linear combinations of real-valued or rational-valued program variables 
and constants), over affine graphs (graphs where each edge is labeled by an afline assignment). We 
can cast the equivalence check for CRAs over (Q, (g>) as a subcase of this problem. 

The algorithm propagates the equation (pi(qi) = p 2 (q 2 ) backward along each transition using 
the register update function: for an edge from 9 to q' with some label a, every equation e\ = e 2 
that must hold at v yields an equation = e' 2 that must hold at u, where the expressions e'i and 
e' 2 are obtained from e% and e 2 using substitution to account for the update of registers along the 
edge from u to v. e\ will be equal to e% where every register x G X is replaced by p(u,a,x). At 
every step of the back propagation, we compute the basis of the set of equations in every state 
using Gaussian elimination. If we reach a system of equations with no solution the two machines 
are inequivalent, while if we reach a fix point where no independent equations can be added, the 
two machines are equivalent. 

As shown in Theorem 2 of [28 , such a propagation terminates in 0(nk 5 ) where n is the size of 
the machine (in our case IQ^IQ^ISQ and k is the number of registers (in our case \X 1 \ + |X 2 |). 

For CRAs that use only increment, the cubic complexity of the Gaussian elimination in the inner 
loop of the equivalence check can be simplified to quadratic: at every step in the back propagation, 
all equations are of the form x = y + c. The final complexity is 0(nk 2 ) if we only have increments 
and 0(nk 3 ) otherwise (k,n are as defined before). □ 

We now show that general containment is also decidable in polynomial time and that checking 
if a number is in the range of a CRA over (Z, +) is decidable in polynomial time. 

Theorem 17 (Mi < M 2 ) Given two CRAs M\ and M 2 over the cost model (Q, +), deciding 
whether Mw G Yf M\{w) < M 2 (w) is in PTime. 
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Proof. We reduce the problem to shortest path. We in fact have that if Vw G ~S*.Mi(w) < M2(w) 
then Vw G Y?.M\(w) — M2(w) > 0. But from Theorem [12] we can construct an M which is 
equivalent to M\(w) — M2(w) and has polynomial size. Now we can solve shortest path on M. The 
algorithm is clearly polynomial. □ 

Theorem 18 (k G Range) Given two CRAs M over the cost model (Z, +) and a constant k G Z„ 
deciding whether 3w G Ti*.M(w) = k is in NLogSpace. 

Proof. We reduce the problem to reachability in a weighted graph. Using Theorem [12] we 
compute M' = M — c. Now we want to check 3w G T,* .M'(w) = 0. We can create the same graph 
of shortest path and look for a path on it. The problem of O-reachability over finite graphs is 
known to be in NLogSpace. □ 

5 Semiring Cost Models 

In this section, we consider the cost models which result when the cost model supports two binary 
operations, ffi and ffi, that impose a semiring structure (see subsection 14.21 for the definition of 
semiring). This structure has been studied extensively in the literature on weighted automata and 
rational power series. A specific case of interest is the tropical semiring, where the cost domain is 
NU{oo}, ffi is the min operation, and ffi is arithmetic addition. While choosing a grammar, we can 
restrict either or both of © and © to be "unary" (that is, the second argument is a constant). To 
study the tropical semiring, it makes sense to choose min to be binary, while addition to be unary. 
Hence, in this section, we will focus on the grammar G(ffi,ffic), and the class IR(B,ffi,ffic) of cost 
functions. 

5.1 CRA Models 

Our first task is to find a suitable set of operations for cost register automata so as to have 
expressiveness same as the class M(D,ffi,ffic). It turns out that (unrestricted) CRAs with © and 
©c are too expressive, while their copyless counterparts are too restrictive. We need to enforce 
the copyless restriction, but allow substitution. In the proposed model, each register x has two 
fields ranging over values from D: (x.c, x.d). The intuitive understanding is that x represents the 
expression [x.d® ?) ffi x.c where ? denotes the parameter. Such a pair can be viewed as the "most 
evaluated" form of a parameterized term in the corresponding SSTT. Expressions used for the 
update are given by the grammar 

e ::= (c, d) \ x | eiffie2 | eiffid | e\ [e^\ 

where x is a register, and c and d are constants. For the min-inc interpretation, the initial val- 
ues are of the form (oo,0) corresponding to the additive and multiplicative identities. We re- 
quire that registers are used in a copyless manner, so that any particular register x appears in 
the update of at most one register. The semantics of the operators on pairs is defined below: 
eiffie2 is defined to be (e\.c ffi ei.c, e\.d); e\®d equals (ei .c ffi d, e\ .d ffi d) ; and e\ \e-^ is given 
by {e\.c ffi e\.d © e2-c, e±.d © e2-d). While registers contain and expressions evaluate to pairs, the 
output function projects out the "c" component of this pair: this is equivalent to instantiating 
the parameter ? to 0, the additive identity, since over semirings, the additive identity annihilates 
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b/x := x,y :=y 




b/x := y,y := (oo,0) 




Ai(<?o) = (oo, oo) 
ft(qi) = (oo, oo) 
M(92) = x 



x := x (By 
y : = (oo, 0) 



Figure 8: The F c (DxD,ffi, ®, [•]) machine for example \W\ 



any other element under multiplication (x.d © x.c = 00 x.c = x.c). The resulting model of 
CRA-definable cost functions is F c (B x B, © , 0d, [■]) 

Example 19 Consider strings w £ {a, 6}*, so i/iai / (w) is the number of a's between the closest 
pair ofb 's. This function is in ¥ (B, min, +c), but not in the more restricted classes: ¥ (B, min (•, d) , +c) 
and F (B, +c). In figurelBi we show a ¥ c (B x B, , , [•]) machine that can compute f . The out- 
put in both qo and q\ is identically oo, while the output in qi is the "c" component of the ouput 
function x: x.c. 

5.2 Expressiveness 

The next theorem summarizes the relationship between functions definable by different CRA mod- 
els. The rest of the session contains the proof of this theorem. 

Theorem 20 (Expressiveness of Semi-ring Cost Functions) 7/(B,ffi,0) forms a semiring, 
then 



Proof. We first show that the containment holds and then that it is strict. Copy less CRAs with 
© and 0c can be simulated by copyless CRAs operating over pairs and performing © , , and [•]. 
Given a F C (B, ffi, 0c) machine Mi, construct an F c (B x B, © , c, [•]) machine M2 with the same 
states, and same registers. Replace every occurrence of ffi and 0c in the update expressions to ffi 
and c respectively. 

To show strict containment, let (B, 0,0) be the tropical semiring. Our witness function is f\ 
from Fig. [Q First off, observe that f\ € F C (B x B, 0, 0c, [•]), as shown in figured! We now 
demonstrate that f\ ^ F C (B, min, -t-c), and our proof is similar to that of theorem [HJ 

We proceed by contradiction. Say we are given a copyless CRA machine M over (B, min, +c) 
that implements /. Without loss of generality, we can assume that in every update, a register x is 
either reset, or appears in its own update expression: x := min (x + c, . . .). 

Consider a string of the form w = b n aeb n ae . . ., where n is greater than the number of states 
in M. Thus, for each i, while processing the i block of b n , some state qi (possibly depending 



F c (B,ffi,0c) C F c (BxB, ffi, 0c, [•]) 



M(D,e,®d) C F(B,ffi,0c) 



We split the proof into the following lemmas. 



Lemma 21 If (B, ffi, 0) forms a semiring, then 



F c (B,©,0c) C F C (B x B, ©, c, [•]) 
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Ix := x[y] <3> 1 
7 » := (oo, Oj 

H(q ) = x 

Figure 9: f\ from figure [His in F c (B x B, , <g> , [•]). 

on the block number i) must be visited at least twice. For each i, let x\ be the register which 
influences the output after reading the input string {b n ae) % a. Now observe that for all % < j, cc, 
and necessarily distinct: if we pump w to w' = (b n aef' 1 b n+k ae (b n aey 1 . . ., the value of xi 

after i blocks must be unchanged, but the value of Xj must change. We have thus established that 
a different register must be used to produce the output after each block, but this is not possible if 
the machine has only a finite number (k in this case) of registers. □ 

Lemma 22 If (B, ©, ®) forms a semiring, then 

F c (BxB, ®,®c, [■]) = R(B,©,®d). 

Proof. From the definition of CRAs the terms constructed by the SSTTs are in correspondence 
with their most evaluated versions maintained by CRAs. □ 

Lemma 23 If (B, ffi, ©) forms a semiring, then 

F c (BxB, ©, ®c, [•]) C F(D,e,®c). 

Proof. Consider a copyless CRA machine M over (B x D, © , ® , [•]). Let K be the set of its 
registers. We construct a copyful CRA M' over (B,©,©c) equivalent to M. We perform the 
following subset construction over registers. The states and transitions of M' are the same as in 
M. The set of registers V\ of M' is the following: 

1. x.c and x.d for every x € V r . 

2. for every 5 C V r , we maintain = <g> x€ sx-d, and for all x £ S, xd$ = x.c © ds- 

The expression on the right of each of the above equalities is the intended invariant we'll maintain. 
Because of the properties of the semiring, we can simplify the resulting expression into a linear 
form (an expression of the form ©j (Xj © Oj) © c, for some constants a, and c, where i ranges over 
the registers). 

We define an elementary update in M as one in which: the value of no register changes, or 
exactly two registers permute: (x, y) := (y, x), or exactly one register is reset: x := (c, d), or exactly 
one register changes: x := x©d, or exactly two registers change (addition): (x,y) := (x©y, (0, 1)}, 
or exactly two registers change (substitution): (x,y) := (x [y] , (0, 1)). Observe that any copyless 
register update can be written as a finite sequence of elementary updates. Also a finite sequence 
of updates in a CRA machine over (B,©,(g>c) can be summarized into a single update. Thus, if 
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we demonstrate a semantics-preserving transformation from elementary updates to copyful linear 
updates, we are done. 

Given a register x € Vi, let x be its value before the update, and x' be its intended value after. 
We show that x' in each case can be written as a linear combination of the old values, thus giving 
a linear update rule x := Expr. Only the last two cases are interesting: 

1. Addition: (x,y) := (x©y, (0, 1)). 

(a) x.d = x.c ffi y.c, x.d! = x.d. y.d = and y.d! = 1. 

(b) For z 7^ x,y. z.d = z.c and z.d! = z.d. 

(c) For S, x,y ^ S. d' s = d$, xd' s = {x.c ffi y.c) © d' s = xd' s © yd' s . yd' s = 0, and zd' s = zd$. 

(d) For S, x € S, but y ^ S. d' s = d$, yd' s = 0, and zd' s = zd$- (Exactly the same as the 
previous case.) 

(e) For S, x £ S, but y G S. Let 5" = S \ {y}. d' s = y.d' © d' s , = d s >. xd' s = x.d © d' s = 
(x.c © y.d) © ds> = xds* ffi yds*. zd' s = z.d © d' s = z.c © ds> = zds'. 

(f) For S, x,y G S. S' = S \ {x,y}. d' s = x.d' © y.d' © d' s , = x.d © d$' = ds'u{x}- 
zd' s = zx ' ® d' s = zd s >u{ x }. 

2. Substitution: {x,y := x [y] , (0, 1)). This shows why we needed to keep the subset registers. 

(a) x.d = x.c ffi x.d © y.c = x.c ffi yd^ x y. x.d' = x.d®y.d = ds x>y \. y.d = and y.d' = 1. 

(b) For z ^ x,y, z.d = z.c and z.d' = z.d. 

(c) For S, x,y $l S. d' s = ds, xd' s = x.d © d' s = (x.c ffi x.c? © y.c) © d$ = xds ffi ydsy-fa}- 
yd' 5 = 0, and zd' s = zds. 

(d) For S, x € S but y S. Let S' = S \ {x}. d' 5 = x.d' © d' S / = dr^yi © ds> = dsu{ y }- 
yd's = 0, and zd' s = z.d © d' s = z.c © dg u i y \ = zd S \j{ y }. 

(e) x ^ S, but y £ S. Let S' = S \ {y}. d' s = y.d' © d S ; = ds'. xd' 5 = x.d © d' 5 = 
(x.c ffi yd{ x }) © c?5' = xds' ffi yds'u{x}- ^'s = z - c ' © d^ = £-c © ds' = zds>. 

(f) Both x,y € S. Let 5' = S r \ {x, y}. d' s = x.d' © y.d' © d' g/ = dr^yi © ds> = ds'ij{x,y} = ds- 
zd's = z - c ' ® d's = z.c <g> ds = zds- 

□ 

Finally, the containment established by the above theorem is strict. 

Lemma 24 Over the tropical semiring, there exist functions in F(D,ffi,©c) which are not in 
M(B,ffi,©d). 

Proof. An example of such a function is in figure HJ Regardless of the string w, f (wb' w ^e) = 
\w\ a . Thus, in any state q, the machine has to contain, in some register x q , \w\ a + c qx . However, 
for all w and k > 0, there is some a so that \ f (wa) — \wa\ | > k. Let's identify some witness for 
this by writing a (w,k). In particular, this means that there has to be some register tracking the 
value of the function, which is distinct from the register x q tracking . We have thus established 
that at least two registers are necessary. 
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Consider a machine with two registers. For the largest constant c appearing in the description 
of the machine, consider the string a c o~ (a c , c). At this point, we have two registers - one containing 
the number of as, and the other containing the function. If we now feed the machine a suffix 
5k( e > c )l C) the value of the function is equal to the number of as in the input string. The machine 
now has two choices: either copy the value, or choose to track both the functions and the number 
of as in the same register. This latter choice cannot happen: for we can then feed the suffix 
a (a (e, c) 6l CT ( e ' c )le) 6l f7 ( cr ( e ' c ) b ' ( ' " e )le, and force the machine into making a mistake. 

For multiple registers, we perform a multi-step pumping argument similar to the above. Define 
the sequence: o\ = a c a(a c ,c), a s = aia c b^ aiaC ^e, <7j+i = o"ja c 6' ,7iaC 'e, ... The argument 
involves observing after reading each Uj, the number of "useful" variables, in the absence of copyful 
assignments, decreases by one: <7i<72 • • • Cj • • •• D 

5.3 Relation to Weighted Automata 

In Section B~2| we noted that single- valued weighted automata correspond exactly to CRAs with ad- 
dition. Now we show that nondeterministic weighted automata and (deterministic) CRAs (without 
the copyless restriction) with © and ©c express exactly the same class of functions. The trans- 
lation from weighted automata to CRAs can be viewed as a generalization of the classical subset 
construction for determinization. 

Theorem 25 (Weighted Automata Expressiveness) If (D, ffi, ©) forms a semiring, then the 
class of functions F(D,©,©c) is exactly that representable by weighted automata. 

Proof. Let W = (E, P, I, F, E, A, p) be a weighted automaton. We construct the corresponding 
CRA M over (D, ffi, © c): The set of states Q = 2 P . The state set is obtained by the standard subset 
construction. The intuition is that M is in state q C P after processing a string w if q is exactly 
the set of states reached in W after processing w. Thus, 8 (g, a) = {p' G P \ Bp G q,p — >yy p'}. The 
initial state go is the set of initial states of the weighted automaton, /. The set of registers is 
X = {x p | p € P}: there is a register x p for every state p G P. The following is the intuition behind 
these registers: consider some state p, and all paths from the set of initial states / to p. Along 
each path, take the ©-product of the weights, and ©-add the values thus obtained for all paths. 
The intent is for x p to hold this value. For each state p € J, the register x p is initialized to A (p). 
Even though in the definition of CRAs, registers were initialized to (or some other constant), by 
simply adding a new initial state which explicitly initializes registers before use, we can simulate 
registers being initialized to anything we choose. When the CRA makes a transition q — s> a g', the 
register update is given by \/p' G q', x p > := ffi{x p © c | p -^-w p'}- That is, to obtain the value of 
x p ', we consider each state p such that there is an a-labeled transition from p to p' with cost c in 
the weighted automaton, add c to x p according to ©, and take © over all such values. In state 
q C P, the output function is defined as © {x p © p(p) \ p G g}. The output function is the ©-sum 
of all the product of all paths. To prove the correctness of this construction, observe the inductive 
invariant: for all w, and for all states p, the register x p in the CRA M contains the ©-sum of the 
©-product of the weights of all paths leading from some initial state to p on to. This depends on 
the distributivity of © over ©. Note that the same register x p contributes to all x p >s for all its 
a-successor states p' . Thus, the update is not necessarily copyless. 

In the reverse direction, let M be a CRA over (B,©,©c) with states Q and registers X. Con- 
struct the following weighted automaton W: 
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1. The state set P is Q x X U Q. Intuitively, a state (q,x) £ Q x X calculates transformations 
happening to individual registers, and states q E Q calculates the constant offset possibly 
imposed by the output function. Formally, after processing some word w, if M reaches state 
q, then the value of register x is equal to the value reaching state (q, x): along each path from 
some initial state to (q,x), multiply all the weights, and add the values thus obtained along 
all such paths. Also, such paths exist iff processing w takes M to state q. 

2. The initial states are / = {qo} x X U {qo}- All initial weights are equal to the multiplicative 
identity. 

3. Say the output function at q E Q in M is given by p q (x) = ©j (x- L <g) ai) © c q . Then, for each 
i, the output weight p (q, x%) = and p (q) = c q . 

4. For every transition q — > a q' in M, create the transition q — } a < q' in W. Also, for every 
variable update x := ffi« (xi ® aj)©c x that occurs during this transition, create the transitions 
(q,xi) -^- a ' ai (q',x) in W (for each i). Also add the transition q — ^ a > c * (q',x). That the 
intended invariant is maintained follows from the distributivity properties of a semiring. 

□ 

5.4 Decision Problems for Min-Plus Models 

Now we turn our attention to semirings in which the cost domain is a numerical domain such as 
N U {oo}, © is the minimum operation, and © is the addition. First let us consider shortest path 
problems for CRAs over the cost model (Q U {oo}, min, +c). Given a CRA over such a cost model, 
we can construct a weighted automaton using the construction in the proof of Theoreml25i Shortest 
paths in a weighted automaton can be solved in polynomial-time using standard algorithms [26J. 

Theorem 26 (Shortest Paths in CRAs over min and +c) Given a CRA M over the cost 
model (Q U {oo} ,min, +c), computing min{M(u>) | w E X*} is solvable in PTlME. 

It is known that the equivalence problem for weighted automata over the tropical semiring is 
undecidable. It follows that checking whether two CRAs over the cost model (N U {oo} , min, +c) 
compute the same cost function, is undecidable. The existing proofs of the undecidability of 
equivalence rely on the unrestricted non-deterministic nature of weighted automata, and thus on 
the copyful nature of CRAs with min and +c. We conjecture that the equivalence problem for 
copyless CRAs over (N U {oo} , min, +c), and also for the class M(N U {oo} , min, +c) is decidable. 

6 Discounted Costs 

In this section, we focus on the class of regular cost functions definable using +c and *d. Such cost 
functions allow both adding costs and scaling by discount factors. 

6.1 Past Discounts 

First let us focus on CRAs over the cost model C (Q, +c, *d). At every step, such a machine can 
set a register x to the value d*x + c: this corresponds to discounting previously accumulated cost 
in x by a factor d, and paying an additional new cost c. We call such machines the past- discount 
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CRAs (see of Figure Q] for an example). Note that the use of multiple registers means that this 
class of cost functions is closed under regular choice and regular look-ahead: the discount factors 
can depend conditionally upon future events. It is easy to check that the cost functions definable 
by past-discount CRAs belong to the class K(Q, +c, *d). Our main result for past-discount CRAs 
is that the min-cost problem can be solved in polynomial-time. First, multiple registers can be 
handled by considering a graph whose vertices are pairs of the form (g, x), where q is a state of the 
CRA, and x is a register. Second, classical shortest path algorithm can be easily modified when 
the update along an edge scales the prior cost before adding a weight to it, this is sometimes called 
generalized shortest path (see [6j 31 J ) . 

Theorem 27 (Shortest Paths for Past Discounts) Given a past-discount CRA M over the 
cost model (Q, +c, *d), computing mm{Al(w) \ w G £*} is solvable in PTlME. 

Proof. We reduce this problem to the generalized shortest path problem (see [HEI]) on a graph, 
where edges are parameterized by cost c(e) and weight w(e) and the cost of a path p = (ei, . . . , e n ) 
is c(p) = c(ei) + w(ei) * (c(e2) + w{e2) *(... + w(e n _i) * c(e n ))), while the weight of p is w{p) = 
w(ei) * ... * w(e n ). 

Consider a past-discount CRA M = (£, Q, go, A, 5, p, //), without loss of generality, assume that 
for each q,q' G Q, a G S, if 5(q,a) = q' , then for all y G A, p(q,a,y) has the form dx + c for 
some x G X; and for all q G Q, if /i(g) is defined, then it has the form dx + c for some x G X. 
We construct the following graph G = (V,E). For each q G Q and x G X, G has a vertex (g, x). 
Moreover, G has a source vertex s, a target vertex t. The graph G maintains the invariant that 
there is a path from s to (g, x) if and only if there is a run of M such that the value of x in state q 
"flows" to the final output. Formally, for each q,q' G Q and a G S, such that S(q,a) = q' , for any 
register y G A: if /?(g, a, y) = dx A- c, for some x, G has an edge e from (g', y) to (g, x) with weight 
w(e) = d and cost c(e) = c. 

Finally, for all q G Q, if /i(g) = dx + c, G has an edge e from s to (g, x) with weight u>(e) = d, 
cost c(e) = c; for each x G A, G has an edge e from (go, x) to t, with weight w(e) = 0, cost c(e) = 0. 

Given a run (go, ^o) • • • (Qn, Vn) of M on input string u> = w\ . . . w n , if the output is defined, 
we claim that there is a s — t path p in G such that the cost of p is equal to the output of M 
on w. For convenience, let's define rj be the register that contributes to final output on state qi, 
i.e. p(q n ) = dr n + c; p(qi,Wi,ri + i) = dri + c for all i < n. Let's define /j(rj) be the function that 
outputs on state i.e. f n (r n ) = KQn), and for i < n, /j(rj) = f i+1 (p(qi,Wi,r i+1 )). It's clear that 
/o(0) is the final output of M on w. Let pi be the path (s, v n , . . . , fj), where v j = (qj,rj). It is easy 
to see that the cost of po is equal to the cost of the path p = (s,v n , . . . ,vo,t). We inductively prove 
that fi(ri) = w{pi)r; L + c(pi). In the base case, if p(q n ) = dr n + c, by construction, c(s, v n ) = c and 
w(s,v n ) = d. Therefore, / n (?"n) = dr n + c = w(p n )r n + c{p n ). Suppose, for all i > k, the inductive 
invariant holds. If p(qk-i,w k ,r k ) = dr k _ x + c, then h-x^k-x) = fk(dr k -i + c) = w(p k )(dr k _i + 
c) + c(p fc ) = dui(p fe )r fc _i + w(p fc )c + c(p k ) = w{p k ^ 1 )r k -i + c(p fc _i), since w{v kl v k ^{) = d and 
c{v k ,v k -\) = c. Therefore, the output of M on w /o(0) = c(po) = c{p). On the other hand, given 
an s—t pathp = (s, (q n , x n ), . . . , (go, ^o), *)> ^ is easy to see that there exists a run (go, vq) . . . (q n , v n ) 
of M on some string w, that the output of M is equal to c(p). □ 

6.2 Future Discounts 

Symmetric to past discounts are future discounts: at every step, the machine wants to pay an 
additional new cost c, and discount all future costs by a factor d. While processing an input 
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w± . . . w n , if the sequence of local costs is c±, . . . c n and discount factors is di, . . . d n , then the cost 
of the string is the value of the term {c\ + d\ * (c% + d2 * (■ ■ ■ )))• Future- discount CRAs are able to 
compute such cost functions using registers that range over Q x Q and substitution: each register 
holds a value of the form (c, d) where c is the accumulated cost and d is the accumulated discount 
factor, and updates are defined by the grammar e := (c, d)|e[c, d]\x. The interpretation for e[c,d] is 
defined to be (e.c + c* e.d, e.d* d) (that is, the current discount factor e.d is scaled by new discount 

d, and current cost e.c is updated by adding new cost c, scaled by the current discount factor 

e. d). Like past-discount CRAs,future-discount CRAs are closed under regular choice and regular 
look-ahead. Processing of future discounts in forward direction needs maintaining a pair consisting 
of cost and discount, and the accumulated costs along different paths is not totally ordered due to 
these two objectives. However, if we consider paths in "reverse", a single cost value updated using 
assignments of the form x := d * x + c as in past-discount CRAs suffices. 

Theorem 28 (Shortest Paths for Future Discounts) Given a future- discounted CRA M over 
the cost model (Q, +c, *d), computing mm{M(w) \ w G X*} is solvable in PTlME. 

Proof. We reduce this problem to the generalized shortest path problem (see [SI [31]) on a graph. 

Consider a future-discount CRA M = (£, Q, qo, X, 5, p, p). First we construct an equivalent 
future-discount CRA M' that every updating function and output function has the form x[c, d]. 
Formally, M' = (E,Q,qo, X' ,8, p' , //), where X' = X U {e}, such that e g" X. For each q,q' G Q, 
and a G S, such that 5(q, a) = q', for each y G X: 

1. if p(q, a, y) = x[c, d] for some x, p'(q, a, y) = x[c, d}\ 

2. if p(q, a, y) = (c, d), p'(q, a, y) = e[c, d]; 

3. p'(q,a,e) = e(0, 1). 

For each q G Q, p'(q) = e(c,d) if p{q) = (c,d), p'(q) = p(q) otherwise. Second, we construct the 
graph G = (V, E) with source s and target i, such that there is a path p from s to v if and only if 
there is a run (go, ^o) • • • (<ln, v n ) of M' and some such that v n (x) = (c(p),w(p)). Formally, 

for each q S Q, and each x G X', G has a vertex (q,x). Moreover, G has a source vertex s, a 
target vertex t. For each q,q' £ Q and a E E, such that 5(q,a) = q', for each register y £ X: if 
p'(q,a,y) = x[c, d] for some x, G has an edge e from (qsx) to (q',y), with weight ui(e) = d, cost 
c(e) = c; Finally, for each q G Q, if = x[c, d], G has an edge e from (q,x) to t with weight 
w(e) = d, cost c(e) = c; for each x € X, G has an edge e from s to (q^s), with weight w(e) = 1, 
cost c(e) = 0. 

Given a run (qo, z^o) • • • (<Zn, ^n) of M' on input string w = w\ . . . w n , if the output is defined, 
we claim that there is a s — t path p in G such that the cost of p is equal to the output of M' on 
w. Let's define rj be the register that contributes to final output on state qi, i.e. p'{q n ) = r n [c,d]; 
p' \qi,Wi,ri + i) = ri[c,d] for all i < n. Let pi be the path (s, t>o, • • • , v i), where Vj = (qj,rj) and 
p = (s, Do, ... j v n ,t). For simplicity, we abuse the notation of the name of register and functions to 
mean their values under interpretation. We inductively prove that rj = (c(pi),w(pi)). In the base 
case, ro = (0,1) = (c(po),w(po)) by construction. Suppose, for all i < k, the inductive invariant 
holds. If p(q k , w k ,r k+ i) = r k [c, d] , then r k +i = (r k .c+c*r k .d,r k .d*d) = (c(p k )+c*w(p k ),w(p k )*d) = 
( c (Pfc+i), w (Pk+i)), since by construction, c(v k , v k+ i) = c and w(v k ,v k+ i) = d. Therefore, the output 
of M' on w p'(q n ) = r n [c,d] = (r n .c + c*r n .d,r n .d*d) = (c(p n )+c*w(p n ),w(p n )*d) = (c(p),w(p)), 
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since by construction c(v n ,t) = c and w(v n ,t) = d. On the other hand, given an s — t path 
P = ( s ) (<7o> x o), • • • ; (in, x n ),t), it is easy to see that there exists a run (go, ^o) • • • (<Zn> ^n) of M' on 
some string w, that the output of M' is equal to c(p). □ 

6.3 Global Discounts 

A global- discount CRA is capable of scaling the global cost (the cost of the entire path) by a 
discount factor. As in case of future-discount CRAs, it uses registers that hold cost-discount 
pairs. We now assume that discounts range over [0, 1] and costs range over Q + . The registers are 
updated using the grammar e := (0, 1) | e+ (c, d) \ x. The interpretation for e + (c, d) is defined to 
be (d * e.c + e.d * c, e.d * d) (that is, the current discount factor e.d is scaled by new discount d, 
and current cost e.c is updated by first scaling it by the new discount, and then adding new cost 
c scaled by the current discount factor e.d). Analyzing paths in a global-discount CRA requires 
keeping track of both the accumulated cost and discount. We can show a pseudo-polynomial upper 
bound; it remains open whether there is a strongly polynomial algorithm for shortest paths for this 
model: 

Theorem 29 (Shortest Paths for Global Discounts) Given a global- discount CRA M over 
the cost model (Q + ,+c, [0,1], *d) and a constant K E Q + , deciding min{M(u>) | w S X*} < K is 
solvable in NP. Computing the minimum is solvable in PTime assuming increments are restricted 
to adding natural numbers in unary encoding. 

Proof. Consider a global-discount CRA M = (£, Q, qo, X, 5, p, fi) , we construct a graph G = 
(V, E), that each edge e G E is parameterized with a cost c(e) and a discount d(e)) as we did above. 
For a pathp = (ei, . . . , e n ), the cost of the path is defined as c(p) = (c(ei), d(e±)) + (c(e2), dfa)) + • • • 
+ (c(e n ), d(e n )) = Y^i c(ej) n« d(e»). Following the proof above, it is easy to see solving the shortest 
path in M is equivalent to solving the shortest path in G. 

To prove the NP bound, we first observe that if there is a reachable cycle that contains an edge 
e with d(e) < 1, then repeating this cycle drives the global discount to 0, and thus, existence of 
such a cycle implies that the min-cost (at the limit) is 0. Notice that the shortest path doesn't need 
to involve a cycle in which all discount factors are equal to 1 (since costs are non- negative) . The 
NP-bound follows from following fact. We can write an NP algorithm that guesses: 1) a reachable 
cycle and verifies if there is an edge e with d(e) < 1 in this cycle, or 2) a simple path and verify 
if its cost is less than K. Suppose incremental costs q's are small natural numbers. The pseudo- 
polynomial algorithm for this case relies on the following idea: for a given value c and a vertex v, 
computing the "best" global discount over all paths from source to v with sum of incremental costs 
equal to c, can be solved by adopting shortest path algorithms, and the set of interesting choices 
of c can be bound by nb for a graph with n vertices if each increment is a number between to b. 
Thus a variation of Bellman- Ford suffices (See algorithm [1]) . □ 

6.4 Regular Functions for Inc-Scale Model 

The class of regular functions for the cost model (Q, +c, *d) is defined via SSTTs over the inc-scale 
grammar G(+c,*d). It is to show that: 

Theorem 30 (Expressiveness of Inc-Scale Models) The cost functions definable by past- discount 
CRAs, by future- discount CRAs, and by global- discount CRAs all belong to M(Q, +c, *d). 
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Algorithm 1 Shortest Path Algorithm on Global Discount CRA 

/ /Discount[i, v, c] stores the best global discount from s to v among paths with length < i, when 
the sum of incremental cost is c. 

Discount[0, v,c] =00, for every node v and < c < nb 
Discount [0, s, 0] = 1 
for i = 1 to n do 

for all v in G and < c < nb do 

Discount[i, v, c] = mm{Discount[i — 1, v, c], mm e= ( u v -j{Discount[i — 1, u, c — c(e)] * d(e)}} 

end for 
end for 

/ / checks if there is a cycle with discount < 1 

if 3v , c, s.t. Discount[n — l,v, c] 7^ Discount[n,v,c] then 

return 
end if 

/ / Output the best simple path 

return mmQ <c < n b{Discount[n — l,t, c] * c} 



The min-cost problem for this class of functions is still open. However, we can show the equiv- 
alence problem to be decidable. First, using the construction similar to the one used to establish 
R(0,©,<S>c) C F(D, ©,(8>c) (see Theorem I20p . we can represent cost functions in K(Q, +c, *d) 
using (copyful) CRAs that use + and *d. Such CRAs have linear updates, and the algorithm for 
checking equivalence of CRAs with addition can be used for this case also. 

Theorem 31 Given two functions f±, fa 6 M(Q, +c, *d) represented by SSTTs over the cost gram- 
mar G(+c,*d), checking whether the two functions coincide, can be solved in time polynomial in 
the number of states and exponential in the number of registers. 

Proof. Given an SSTT T for / £ M(D, +c, *d), we use lemma [23] to construct an equivalent CRA 
M using + and *d. M has the same number of states as T, and an exponential number of variables. 
We claim that checking equivalence between two CRAs with + and *d is solvable in time polynomial 
in the number of states and number of variables (the proof is similar to that of theorem 1 16|) . Thus, 
checking the equivalence of two functions f\ and /2 expressed as STTs over G(+c, *d) can be done 
in time polynomial in the number of states and exponential in the number of variables. □ 

7 Related Work 

Weighted Automata (wa) and Logics. Finite-state WA have been an active area of research, 
with numerous articles studying their algebraic and algorithmic properties. See [2] for a compre- 
hensive exposition. An important problem for WA is that of determinization \26\ I24j. A determin- 
istic WA is defined in the usual sense: no two outgoing transitions from a state share the same 
input label. It has been shown that there are WA that do not admit equivalent deterministic WA. 
In contrast, the cost register automata that we introduce in this paper are deterministic machines 
with equivalent expressive power and equally efficient decision problems as weighted automata. We 
believe that this makes them a more suitable model for expressing weighted computations. 
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It has been shown that the equivalence problem for WA over the tropical semiring is undecidable 
using a reduction from Hilbert's tenth problem [25], and by a reduction from the halting problem 
for two counter machines [1] . The only known class of weighted automata over the tropical semiring 
with decidable equivalence problem is that of finite- valued weighted automata [33]. For a given k, 
a weighted automaton is said to be /c-valued if the number of distinct values computed along all 
accepting paths is at most k. A weighted automaton is called finite- valued if there exists a k such 
that it is fe-valued. We conjecture that the equivalence problem for CRA over min and +c with the 
copyless restriction is decidable. If this is true, it would give the largest known class with decidable 
equivalence. In [23] the authors provide a randomized algorithm to solve equivalence of weighted 
automata over the semiring with addition and scaling. 

In [13], the authors discuss a weighted MSO-logic that disallows universal second order quan- 
tification and places restrictions on universal first order quantification. The authors show that the 
formal power series definable in this logic coincides with the set of behaviors of weighted automata. 
In contrast, in this paper, we introduce automata and machines that exactly capture MSO-definable 
cost functions. 

Discounted weighted computations and Generalized Shortest Paths. Generalized net- 
work flow problems extend flow problems on directed graphs by specifying multipliers on edges in 
addition to costs [19} [3l] . The problem of finding the minimum cost flow (which in some cases is 
equivalent to the shortest distance path) from a source to a target can be solved in polynomial 
time |3i4 E]. Future discount machines that we introduce in this paper provide a nice formalism 
that subsumes such problems, and have strongly polynomial time algorithms for determining the 
minimum cost path. In this paper, we also introduce past discount and global discount machines 
that also have efficient algorithms for determining the minimum cost paths. 

In [15], the authors introduce weighted logic for infinite words. In order to address convergence 
of the weighted sum, the authors assume discounting along later edges in a path (i.e., future 
discounting) . Extending the results of this paper to discounted weighted computations over infinite 
words remains open. 

Transducer Models. A wide variety of different models have been proposed to model string and 
tree transductions. The models that are most relevant to this paper are MSO-definable transduc- 
tions |12t 116] and macro tree transducers |18} 117], An MSO-definable graph transduction specifies 
a function between sets of graphs; the nodes, edges and labels of the output graph are described 
in terms of MSO formulas over the nodes, edges and labels of a finite number of copies of the 
input graph. A macro tree transducer (MTT) is a top-down tree to tree transducer equipped with 
parameters. Parameters can store temporary trees and append them to the final tree during the 
computation. In general, MTT are more expressive than MSO-definable tree transductions. A 
subclass of MTTs obtained by restricting the number of times a subtree and a parameter can be 
used has been shown to be equi-expressive as MSO-definable tree transductions [IT]. In addition to 
these models, formalisms such as attribute grammars [17], attribute tree transducers [8] have also 
been studied. 

Streaming tree transducers [I] (STTs), introduced by two of the co-authors in this paper are a 
new formalism for expressing MSO-definable tree-to-tree and string-to-tree transductions. In com- 
parison to some of the transducer models discussed above, STTs have distinguishing features that 
make them desirable as a canonical model for specifying regular or MSO-definable transductions: 
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(1) STTs produce the output in linear time by performing a single pass over the input, (2) they 
preserve desirable properties such as closure under sequential composition and regular look-ahead, 
and (3) they have good algorithmic properties such as decidability of functional equivalence. 

Regularity over Data Languages. Data languages allow finite strings over data values that 
can be drawn from a possibly infinite data domain [30], [2T] [7]. Register automata are often used 
as acceptors for data languages. A key feature of such automata is that they allow registers to store 
and test data values. Beyond the similarity in nomenclature, register automata that are studied in 
this line of work are quite distinct from cost register automata introduced in this paper. The former 
are essentially defined over an infinite input alphabet, and the critical difference lies in the fact that 
almost every variant of data automata allows testing equality of data values, which mostly causes 
interesting decision problems to become undecidable. Cost register automata use the cost registers 
in a strictly write-only fashion, which makes them incomparable to variants of data automata that 
use read/write registers. 

Regular Cost Functions. In [10J, Colcombet defines a regular cost function as a mapping from 
words to N w (the set of nonnegative integers and the ordinal uj). A cost function is precisely defined 
as an equivalence class over mappings from the set of words to N w , such that functions / and g 
are in the same equivalence class if for all words w, f(w) is bounded by some constant iff g(w) is 
bounded is bounded by some constant. The author then defines two classes of automata (B- and 
S-automata) , each of which uses a finite set of counters and allows the counters to be incremented, 
reset or checked for equality with a constant. The operational semantics of these automata are 
that the automaton computes the least upper bound or the greatest lower bound over the set of 
counter values encountered during its run. A cost function is then called regular if it is accepted by a 
history-deterministic B- or S-automaton. The author also provides an algebraic characterization of 
regular cost functions in terms of stabilization monoids and equates recognizability of cost functions 
with regularity. In [TT], the authors extend this notion to regular cost functions over trees. 

It is clear that the notions proposed in this line of work are orthogonal to our characterization 
of regularity of cost functions. The authors state that the motivation for the work in |10t [TT] 
is preserving nice algorithmic and closure properties of regular languages for problems such as 
equivalence and projection. However, the integer values in these functions are considered modulo 
an equivalence which preserves existence of bounds on the function values, but not the values 
themselves. We believe that the notion of regularity of cost functions that we propose in this paper 
is closer to the classical notions of regularity such as MSO-definability. 

AfRne Programs. In [22J, and more recently in [28, 29J, the authors present the problem of 
deriving affine relations among variables of a program. An affine relation is a property of the form 
ao + Y27=i a «- v i = 0j where vi, . . . , v n are program variables that range over a field such as the 
rationals or reals and a% are constants over the same domain. An affine program is a program 
with nondeterministic branching where each edge of the program is labeled with an assignment 
statement of the form v\ := V2 + 2.v% + 3, i.e., where the RHS is an affine expression. We could 
define a CRA over the cost model C(Q, +, Q, *d), with the cost grammar t := +(t,t) | * (t,d) \ c. 
An affine program is then simply obtained by ignoring the input labels of the transitions in such a 
CRA. While the cost functions defined by such CRA do not have interesting regularity properties, 
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(b) Complexity of Decision Problems 



Figure 10: Summary of Results 



we remark that the equivalence of such CRA can be checked in polynomial time by using ideas 
similar to the ones in |28|. 



Quantitative Languages. A quantitative language [91 [TJ [5] over infinite words is a function 
T, w i-> R. Such languages are generated by weighted automata, where the value of a word w is set 
as the maximal value of all runs over w. By defining various value functions such as Max, Sum, 
LimSup, Limlnf, different values can be computed for the run of a weighted automaton on the 
string uu. Quantitative languages use the fixed syntax of weighted automata, and thereby restricted 
to having a single weight along each transition in their underlying automata. Moreover, they face 
similar difficulties in determinization: for interesting models of value functions, the corresponding 
automata cannot be determinized. An extension of CRA to ^-regular cost functions could prove 
to be a more expressive and robust model to specify quantitative languages and to analyze their 
decision problems. 



8 Conclusions 

We have proposed a new approach to define regular functions for associating costs with strings. 
The results for various classes of functions are summarized in Figure 2. We hope that our work 
provides new insights into the well-studied topic of weighted automata, and opens a whole range of 
new problems. First, it is plausible that there is a compelling notion of congruences and canonicity 
for CRAs with increment. Second, the decidability of copyless-CRAs with min and increment 
remains an intriguing open problem. Third, we don't have algorithms for the min-cost problem for 
the class of regular functions with increment and scaling. While we have not succeeded even in 
establishing decidability, we suspect that this problem admits efficient approximation algorithms. 
Fourth, we have considered only a small set of combinations of operations; studying the effects 
of adding operators such as max would be worthwhile. Fifth, our notion of regularity and cost 
register automata for mapping strings to costs can be extended to infinite strings and trees, as well 
as to timed and probabilistic systems. Finally, we would like to explore practical applications: our 
framework seems suitable for expressing complex, yet analyzable, pricing policies, say, for power 
distribution. 
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